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FIELD OF THE INVENTION 

The present invention relates to a video communication system, which is able to 
5 receive a digital video signal, according to the preamble of claim 1. The invention further 
relates to a method of processing a digital video signal according to the preamble of claim 8. 

Such a system may be used, for example, for 3D video applications within MPEG 
standards. 



10 BACKGROUND OF THE INVENTION 

A video commimication system typically comprises a transmitter with an encoder and 
a receiver wifli a decoder. Such a system receives an input digital video signal, encodes said 
signal via the encoder, transmits the encoded signal to the receiver, then decodes the 
transmitted signal via the decoder resulting in an ou^ut digital video signal, which is the 

15 reconstructed signal of the input digital video signal. The receiver then displays said output 
digital video signal. A digital video signal comprises some sets of objects, which are 
characterized by components such as shapes, textures, motion information, disparity map (in 
the case of 3D video signal), etc. 

When an object is encoded, the components of said object are encoded. The encoding 

20 and decoding processes of each video component can more or less depend on each other. 
Coding performance relies on this inter-dependence. 

Let us give an example with 3D video objects encoded by using the MPEG-4 
standard. In this document referred to under the MPEG-4 document number w3056 at ISO 
and entitled "Information Technology - Coding of audio-visual objects - Part 2: Visual, 

25 ISO/DBC JTC 1/SC 29 /WG 11, Maui, December 1999", when one encodes a stereo video 
sequence comprising two views, one view — say the left one — and the corresponding 
disparity maps (or depth maps) are encoded. The right view is reconstructed by projecting the 
left view using the disparity map. Parts of the right view that were not in the left one 
(occlusion parts) cannot be reconstructed and holes remain. In order to reconstruct the right 

30 view properly, one encodes the occlusion parts retrieved fi'om the original right view as an 
MPEG-4 video object, whose shape corresponds to the holes. If the left view is encoded at a 
bit rate of 3.5 Mbit/s, the enhancement layer defined to handle occlusion parts should not be 
larger than 10% of this bit rate. Here it takes 340 kbit/s. Occlusion objects are defined by 
shape and texture. Using the standard way of coding objects, their bit rate costs are: 
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Shape: 93 kbit/s i.e. 27% of the bit rate allocated to occlusion 

Texture: 128 kbit/s i.e. 37% of the bit rate. 
The occlusion shapes can be determined from the disparity map. Still, component encoding 
being interdependent, it is not possible to efficiently encode the texture of the occlusion parts, 
5 without encoding the shapes. Therefore, the bit rate cost of the video sequence is not optimal. 

OBJECT AND SUMMARY OF THE INVENTION 

Accordingly, it is an object of the invention to provide a video communication system 
as defined in the preamble of claim 1 and a method as defined in the preamble of claim 8, 
10 which lower the bit rate needed to encode objects. 

To this end, according to a first object of the invention, there is provided a video 
communication system as claimed in claiml. 

In addition, according to a second object of the invention, there is provided a method 
15 as claimed in claim 8. 

As we will see in detail below, by providing the possibility of not encoding a first 
component and therefore not transmitting it while encoding the other components as if it was 
the case, the encoding efficiency will be improved, because fewer bits will be necessary to 
20 encode the whole object 

BRIEF DESCRIPTION OF THE DRAWINGS 

Additional objects, features and advantages of the invention will become apparent 
upon reading the following detailed description and upon reference to the accompanying 
25 drawings in which: 

- Fig. 1 illustrates a video commimication system comprising an encoder and a decoder 
according to the invention. 

Fig. 2 is schematic diagram of the encoding method used by the encoder of the video 
commimication according to the invention, and 
30 - Fig. 3 is schematic diagram of the decoding method used by the decoder of the video 
communication according to the invention. 



DETAILED DESCRIPTION OF THE INVENTION 
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In the following description, functions or constructions well known to the person 
skilled in the art are not described in detail because they would obscure the invention in 
unnecessary detail. 

The present invention relates to a video communication system for processing a 
5 digital video signal. 

Such a system, depicted in Fig. 1, may be used for video applications in MPEG2 or 
MPEG4, wherein said video communication system comprises a transmitter TRANS, a 
transmission medium CH and a receiver RECEIV. Said transmitter TEIANS and said receiver 
RECEIV comprise an encoder ENC and a decoder DEC, respectively. 

In order to transmit efficiently some video signals through the transmission medium 
CH, in which the transmitted bits of the video signals are known as bit stream BIT^STR, said 
encoder ENC applies an encoding on a video signal, then the encoded video signal is sent to a 
decoder DEC, which decodes said signal. Finally, the receiver RECEIV displays said video 
IS signal. 

A video signal comprises some sets of objects OBJ with dififerent components COMP 
such as shape, texture, motion vectors, disparity map, colors, etc. 

When the encoder ENC encodes an object OBJ, it encodes effectively all the 
components COMP of said object OBJ. Much encoding of components is dependent from 
20 other components. For exanrple, in the INTER mode scheme, well known to the person 

skilled in the art, texture ioformation can be used only if we have motion information. When 
it comes to video object coding, in MPEG-4 using the block-based principle, texture block 
positions are determined by shape, and knowing this shape allows improvement of coding 
efficiency by using spatial redundancy between co-located texture blocks. 

25 

The encoder ENC comprises availability information FLAG for determining if a 
component COMP of an object is to be encoded, or not, encoding means for encoding said 
component COMP if said availability information FLAG is enabled, and transmission means 
for transmitting said component COMP of said object OBJ if the availability information 
30 FLAG is enabled. 

The decoder DEC comprises decoding means for decoding a component COMP if the 
availability information FLAG is enabled and retrieving means for retrieving said component 
COMP if the availability information FLAG is disabled. 
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The encoding of an object OBJ is done by the encoder ENC as follows, illustrated in 

Fig. 2. 

For each component COMP of the object OBJ, the encoder ENC decides if said 
component COMP will be included in the bit stream BIT^STR or not. Decision depends on 
5 the type of video application: for example, in a stereo system where one encodes the 
occlusion part, one knows that there is no need to encode and transmit the shape of these 
occlusions. 

Availability information FLAG is assigned to each component COMP of the object 
OBJ. If a component COMP is to be included in the bit stream, i.e. encoded and transmitted, 
10 the availability information FLAG is enabled. Preferably, this availability information FLAG 
is placed at video object level (VO in MPEG-4). 

In a first non-limitative embodiment, this availability information FLAG has two 
values: 1, when enabled, and 0 when disabled 

In a second non-limitative embodiment, this availability information FLAG has an 
15 extended syntax: 1, when enabled, and 0 plus a description codeword when disabled. The 
description codeword tells how to retrieve the missing component COMP (wait for extra 
input, like a pre-computed shape, wait for extea input and n*** component information, etc. . .). 
Afterwards, tiie decoder DEC will use this description codeword to retrieve a component 
COMP. It is supposed that at the decoder side, the way to retrieve a missing component 
20 COMP is well known and retrieval algorithms are available. 

Subsequently, the encoder ENC encodes all the components COMP (step 
ENC_COMP in Fig. 2) that will be included in the bit-stream BIT_STR and for the 
components COMP_D, which need the missing components COMP_M, it encodes them as if 

25 all the components COMP of the object OBJ and especially the missing components 

COMP_M have been used, encoded and transmitted as well. Indeed, some of the coding 
components may require the use of other components as described before. 

Note that the availability information FLAG and potentially description codeword 
(depending on the kind of availability information used in the application) are defined for 

30 each object components COMP. The encoded video object OBJ then corresponds to all these 
availability information FLAG, along with the included encoded components COMP. Hence, 
the bit stream BIT_STR that is transmitted to the decoder comprises the availability 
information FLAG of the missing components, and for every other component, their 
corresponding availability information FLAG and the encoded component COMP itself 
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Note that the following hypothesis has been made. We suppose tiiat we have an 
encoding algorithm where video objects OBJ fall into several components and that they 
correspond to different, separable parts of the bit stream BIT_STR. It means that it is possible 
5 to replace one component without preventing the decoding process of another component 
For example, in MPEG-4, if we encode a video sequence with texture and depth information, 
if we replace or modify the encoded texture information, it is still possible to decode depth. 
Furthermore, if we change the motion vector components, it is possible to decode tiie texture 
- even though we have a sequence that is visually different firom the original, but the encoded 
10 texture information is still correct 

Said encoded bit stream BIT_STR is then sent to the decoder DEC of the receiver 
RECEIV, via the transmission medium CH. 

15 The decoding of the encoded bit-stream BIT_STR is performed by the decoder DEC 

as follows and as illustrated in Fig. 3. 

When receiving the encoded bit-stream BIT^STR, the decoder DEC first checks the 
availability information FLAG ia said bit-stream BIT_STR. If availability information FLAG 
is set to 1, it waits for the corresponding components COMP in the bit-stream BIT_STR, 

20 which follow their assigned availability information FLAG. Then, it decodes them. It first 
decodes the components COMP Hiat do not need any other encoded components or missing 
components COMP_M (step DEC_COMP m Fig. 3), and then the components COMP_D that 
are dependent on the other encoded components and on the missing components COMP_M. 
For these later dependent components COMP_D, the decoder DEC retrieves the missing 

25 components COMP_M (step RETR_COMP_Ml in Fig. 3) and then decodes these dependent 
components COMP_D (step DEC_COMP_D in Fig. 3). The dependence relation between 
components is application dependent: in a stereo video decoder, we could say that when 
decoding the occlusion shape, one uses the decoded disparity map - this could be done by 
introducing a description codeword meaning "Retrieved from Disparity". To take a more 

30 general case, in the MPEG coding scheme, texture information is retrieved from motion 
vector and a previous frame. 

Once all of the encoded conq)onents COMP are decoded, the decoder DEC retrieves 
the last missing components COMPJM (step RETR_COMP_M2 on the Fig. 3). Note that this 
last step can also be performed at any other time, for example, at the beginning. 
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The bit stream BIT_STR is as if none of the components has been removed. The 
decoding process, well known to the person skilled in the art, is possible, as if every 
component has been regularly encoded and transmitted. 

5 Thus, one advantage of the present invention is to simply decrease the bit rate used to 

encode an object by only using availability information FLAG, and to simply informing the 
decoder that it needs to retrieve some components itself. 

It is to be imderstood that the present invention is not limited to the aforementioned 
10 embodiments and variations and modifications may be made without departing firom the 
spirit and scope of the invention as defined in the appended claims. In this respect, the 
following closing remarks are made. 

It is to be understood that the present invention is not limited to the aforementioned 
video application. It can be used within any application using a system for processing a signal 
IS that is decomposed into several components, the encoding of which results in separated bit- 
stream parts. In particular, the invention applies to video compression algorithms of the other 
MPEG standards family (MPEG-1, MPEG-2) and to the ITU H26X family (H261, H263 and 
extensions, H261 being the latest today, reference number Q15-K-59). 

20 It is to be understood that the method according to the present invention is not limited 

to the aforementioned implementation. 

There are numerous ways of implementing functions of the method according to the 

invention by means of items of hardware or software, or both, provided that a single item of 

hardware or software can carry out several functions. It does not exclude that an assembly of 
25 items of hardware or software or both carries out a function, thus forming a single function 

without modifying the method of processing the video signal in accordance with the 

invention. 

Said hardware or software items can be implemented in several manners, such as by 
means of wired electronic circuits or by means of an integrated circuit that is stutably 
30 programmed. The integrated circuit can be accommodated in a computer or in a video system 
communication. In the second case, the video system communication comprises encoding 
means for encoding a component of an object if the availability information is enabled, 
decoding means for decoding said component of said object if the availability information is 
enabled, and retrieving means for retrieving said component of said object if the availability 
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information is disabled, as described previously, said means being hardware or software 
itenos as stated above. 

The integrated circuit comprises a set of instructions. Thus, said set of instructions 
comprised, for example, in a computer programming memory or in a video communication 
system may cause the computer or the video communication system to carry out the dififerent 
steps of the encoding method. 

The set of instructions may be loaded into the programming memory by reading a 
data carrier such as, for example, a disc. A service provider can also make the set of 
instructions available via a communication network such as, for example, the Internet. 

Any reference sign in the following claims should not be construed as limiting the 
claim. It will be obvious that the use of the verb "to comprise" and its conjugations does not 
exclude the presence of any other steps or elements besides those defined in any claim. The 
article "a" or "an" preceding an element or step does not exclude the presence of a plurality 
of such elements or steps. 



